Categorical understanding using statistical ngram models

نویسندگان

  • Alexandros Potamianos
  • Giuseppe Riccardi
  • Shrikanth S. Narayanan
چکیده

In this paper, the speech understanding problem in the context of a spoken dialog system is formalized in a maximum likelihood framework. Word and dialog-state n-grams are used for building categorical understanding and dialog models, respectively. Acoustic con dence scores are incorporated in the understanding formulation. Problems due to data sparseness and out-of-vocabulary words are discussed. Incorporating dialog models reduces relative understanding error rate by 1525%, while acoustic con dence scores achieve a further 10% error reduction for a computer gam-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phrase and Ngram-Based Statistical Machine Translation System Combination

Multiples translations can be computed by one machine translation (MT) system or by different MT systems. We may assume that different MT systems make different errors due to using different models, generation strategies, or tweaks. An investigated technique, inherited from automatic speech recognition (ASR), is the so-called system combination that is based on combining the outputs of multiple...

متن کامل

Modeling of Long Distance Context Dependency

Ngram models are simple in language modeling and have been successfully used in speech recognition and other tasks. However, they can only capture the short distance context dependency within an n-words window where currently the largest practical n for a natural language is three while much of the context dependency in a natural language occurs beyond a three words window. In order to incorpor...

متن کامل

Recursive Path Models when Both Predictor and Response Variables are Categorical

Recursive path analysis is a useful tool for inference on a sequence of three or more response variables in which the causal effects of variables, if any, are in one direction. The primary objective in such analysis is to decompose the total effect of each variable into its direct and indirect components. Methods for recursive analysis of a chain of continuous variables are well developed but t...

متن کامل

Should substance use disorders be considered as categorical or dimensional?

AIMS This paper discusses the representation of diagnostic criteria using categorical and dimensional statistical models. Conventional modeling using categorical or continuous latent variables in the form of latent class analysis and factor (IRT) analysis has limitations for the analysis of diagnostic criteria. METHODS New hybrid models are discussed which provide both categorical and dimensi...

متن کامل

Head-Driven Parsing for Word Lattices

We present the first application of the head-driven statistical parsing model of Collins (1999) as a simultaneous language model and parser for largevocabulary speech recognition. The model is adapted to an online left to right chart-parser for word lattices, integrating acoustic, n-gram, and parser probabilities. The parser uses structural and lexical dependencies not considered by ngram model...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999